Production of peptides in plants as viral coat protein fusions

ABSTRACT

The present invention relates to foreign peptide sequences fused to recombinant plant viral structural proteins and a method of their production. Fusion proteins are economically synthesized in plants at high levels by biologically contained tobamoviruses. The fusion proteins of the invention have many uses. Such uses include use as antigens for inducing the production of antibodies having desired binding properties, e.g., protective antibodies, or for use as vaccine antigens for the induction of protective immunity, including immunity against parasitic infections.

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The present application is a continuation application of U.S. patent application Ser. No. 08/324,003, filed Oct. 14, 1994, which is a continuation-in-part of U.S. patent application Ser. No. 08/176,414, filed on Dec. 29, 1993, and which is a continuation-in-part of U.S. patent application Ser. No. 07/997,733, filed Dec. 30, 1992.

FIELD OF THE INVENTION

[0002] The present invention relates to the field of genetically engineered peptide production in plants, more specifically, the invention relates to the use of tobamovirus vectors to express fusion proteins.

BACKGROUND OF THE INVENTION

[0003] Peptides are a diverse class of molecules having a variety of important chemical and biological properties. Some examples include; hormones, cytokines, immunoregulators, peptide-based enzyme inhibitors, vaccine antigens, adhesions, receptor binding domains, enzyme inhibitors and the like. The cost of chemical synthesis limits the potential applications of synthetic peptides for many useful purposes such as large scale therapeutic drug or vaccine synthesis. There is a need for inexpensive and rapid synthesis of milligram and larger quantities of naturally-occurring polypeptides. Towards this goal many animal and bacterial viruses have been successfully used as peptide carriers.

[0004] The safe and inexpensive culture of plants provides an improved alternative host for the cost-effective production of such peptides. During the last decade, considerable progress has been made in expressing foreign genes in plants. Foreign proteins are now routinely produced in many plant species for modification of the plant or for production of proteins for use after extraction. Animal proteins have been effectively produced in plants (reviewed in Krebbers et al., 1992).

[0005] Vectors for the genetic manipulation of plants have been derived from several naturally occurring plant viruses, including TMV (tobacco mosaic virus). TMV is the type member of the tobamovirus group. TMV has straight tubular virions of approximately 300×18 nm with a 4 nm-diameter hollow canal, consisting of approximately 2000 units of a single capsid protein wound helically around a single RNA molecule. Virion particles are 95% protein and 5% RNA by weight. The genome of TMV is composed of a single-stranded RNA of 6395 nucleotides containing five large ORFs. Expression of each gene is regulated independently. The virion RNA serves as the messenger RNA (mRNA) for the 5′ genes, encoding the 126 kDa replicase subunit and the overlapping 183 kDa replicase subunit that is produced by read through of an amber stop codon approximately 5% of the time. Expression of the internal genes is controlled by different promoters on the minus-sense RNA that direct synthesis of 3′-coterminal subgenomic mRNAs which are produced during replication (FIG. 1). A detailed description of tobamovirus gene expression and life cycle can be found, among other places, in Dawson and Lehto, Advances in Virus Research 38:307-342 (1991). It is of interest to provide new and improved vectors for the genetic manipulation of plants.

[0006] For production of specific proteins, transient expression of foreign genes in plants using virus-based vectors has several advantages. Products of plant viruses are among the highest produced proteins in plants. Often a viral gene product is the major protein produced in plant cells during virus replication. Many viruses are able to quickly move from an initial infection site to almost all cells of the plant. Because of these reasons, plant viruses have been developed into efficient transient expression vectors for foreign genes in plants. Viruses of multicellular plants are relatively small, probably due to the size limitation in the pathways that allow viruses to move to adjacent cells in the systemic infection of entire plants. Most plant viruses have single-stranded RNA genomes of less than 10 kb. Genetically altered plant viruses provide one efficient means of transfecting plants with genes coding for peptide carrier fusions.

SUMMARY OF THE INVENTION

[0007] The present invention provides recombinant plant viruses that express fusion proteins that are formed by fusions between a plan viral coat protein and protein of interest. By infecting plant cells with the recombinant plant viruses of the invention, relatively large quantities of the protein of interest may be produced in the form of a fusion protein. The fusion protein encoded by the recombinant plant virus may have any of a variety of forms. The protein of interest may be fused to the amino terminus of the viral coat protein or the protein of interest may be fused to the carboxyl terminus of the viral coat protein. In other embodiments of the invention, the protein of interest may be fused internally to a coat protein. The viral coat fusion protein may have one or more properties of the protein of interest. The recombinant coat fision protein may be used as an antigen for antibody development or to induce a protective immune response.

[0008] Another aspect of the invention is to provide polynucleotides encoding the genomes of the subject recombinant plant viruses. Another aspect of the invention is to provide the coat fusion proteins encoded by the subject recombinant plant viruses. Yet another embodiment of the invention is to provide plant cells that have been infected by the recombinant plant viruses of the invention.

BRIEF DESCRIPTION OF THE FIGURES

[0009]FIG. 1. Tobamovirus Gene Expression The gene expression of tobamoviruses is diagrammed.

[0010]FIG. 2. Plasmid Map of the TMV Transcription Vector pSNC004 The infectious RNA genome of the U1 strain of TMV is synthesized by T7 RNA polymerase in vitro from pSNC004 linearized with KpnI.

[0011]FIG. 3. Diagram of Plasmid Constructions Each step in the construction of plasmid DNAs encoding various viral epitope fusion vectors discussed in the examples is diagrammed.

[0012]FIG. 4. Monoclonal Antibody (NVS3) Binding to TMV291 The reactivity of NVS3 to the malaria epitope present in TMV29 1 is measured in a standard ELISA.

[0013]FIG. 5. Monoclonal Antibody (NYS1) Binding to TMV261 The reactivity of NYS1 to the malaria epitope present in TMV261 is measured in a standard ELISA.

DESCRIPTION OF THE SPECIFIC EMBODIMENTS

[0014] Definitions and Abbreviations

[0015] TMV: Tobacco mosaic tobamovirus

[0016] TMVCP: Tobacco mosaic tobamovirus coat protein

[0017] Viral Particles: High molecular weight aggregates of viral structural proteins with or without genomic nucleic acids

[0018] Virion: An infectious viral particle.

[0019] The Invention

[0020] The subject invention provides novel recombinant plant viruses that code for the expression of fusion proteins that consist of a fusion between a plant viral coat protein and a protein of interest. The recombinant plant viruses of the invention provide for systemic expression of the fusion protein, by systemically infecting cells in a plant. Thus by employing the recombinant plant viruses of the invention, large quantities of a protein of interest may be produced.

[0021] The fusion proteins of the invention comprise two portions: (i) a plant viral coat protein and (ii) a protein of interest. The plant viral coat protein portion may be derived from the same plant viral coat protein that serves a coat protein for the virus from which the genome of the expression vector is primarily derived, i.e., the coat protein is native with respect to the recombinant viral genome. Alternatively, the coat protein portion of the fusion protein may be heterologous, i.e., non-native, with respect to the recombinant viral genome. In a preferred embodiment of the invention, the 17.5 KDa coat protein of tobacco mosaic virus is used in conjunction with a tobacco mosaic virus derived vector. The protein of interest portion of the fusion protein for expression may consist of a peptide of virtually any amino acid sequence, provided that the protein of interest does not significantly interfere with (1) the ability to bind to a receptor molecule, including antibodies and T cell receptor (2) the ability to bind to the active site of an enzyme (3) the ability to induce an immune response, (4) hormonal activity, (5) immunoregulatory activity, and (6) metal chelating activity. The protein of interest portion of the subject fusion proteins may also possess additional chemical or biological properties that have not been enumerated. Protein of interest portions of the subject fusion proteins having the desired properties may be obtained by employing all or part of the amino acid residue sequence of a protein known to have the desired properties. For example, the amino acid sequence of hepatitis B surface antigen may be used as a protein of interest portion of a fusion protein invention so as to produce a fusion protein that has antigenic properties similar to hepatitis B surface antigen. Detailed structural and functional information about many proteins of interest are well known, this information may be used by the person of ordinary skill in the art so as to provide for coat fusion proteins having the desired properties of the protein of interest. The protein of interest portion of the subject fusion proteins may vary in size from one amino acid residue to over several hundred amino acid residues, preferably the sequence of interest portion of the subject fusion protein is less than 100 amino acid residues in size, more preferably, the sequence of interest portion is less than 50 amino acid residues in length. It will be appreciated by those of ordinary skill in the art that, in some embodiments of the invention, the protein of interest portion may need to be longer than 100 amino acid residues in order to maintain the desired properties. Preferably, the size of the protein of interest portion of the fusion proteins of the invention is minimized (but retains the desired biological/chemical properties), when possible.

[0022] While the protein of interest portion of fusion proteins of the invention may be derived from any of the variety of proteins, proteins for use as antigens are particularly preferred. For example, the fusion protein, or a portion thereof, may be injected into a mammal, along with suitable adjutants, so as to produce an immune response directed against the protein of interest portion of the fusion protein. The immune response against the protein of interest portion of the fusion protein has numerous uses, such uses include, protection against infection, and the generation of antibodies useful in immunoassays.

[0023] The location (or locations) in the fusion protein of the invention where the viral coat protein portion is joined to the protein of interest is referred to herein as the fusion joint. A given fusion protein may have one or two fusion joints. The fusion joint may be located at the carboxyl terminus of the coat protein portion of the fusion protein (joined at the amino terminus of the protein of interest portion). The fusion joint may be located at the amino terminus of the coat protein portion of the fusion protein (joined to the carboxyl terminus of the protein of interest). In other embodiments of the invention, the fusion protein may have two fusion joints. In those fusion proteins having two fusion joints, the protein of interest is located internal with respect to the carboxyl and amino terminal amino acid residues of the coat protein portion of the fusion protein, i.e., an internal fusion protein. Internal fusion proteins may comprise an entire plant virus coat protein amino acid residue sequence (or a portion thereof) that is “interrupted” by a protein of interest, i.e., the amino terminal segment the coat protein portion is joined at a fusion joint to the amino terminal amino acid residue of the protein of interest and the carboxyl terminal segment of the coat protein is joined at a fusion joint to the amino terminal acid residue of the protein of interest.

[0024] When the coat fusion protein for expression is an internal fusion protein, the fusion joints may be located at a variety of sites within a coat protein. Suitable sites for the fusion joints may be determined either through routine systematic variation of the fusion joint locations so as to obtain an internal fusion protein with the desired properties. Suitable sites for the fusion jointly may also be determined by analysis of the three dimensional structure of the coat protein so as to determine sites for “insertion” of the protein of interest that do not significantly interfere with the structural and biological functions of the coat protein portion of the fusion protein. Detailed three dimensional structures of plant viral coat proteins and their orientation in the virus have been determined and are publicly available to a person of ordinary skill in the art. For example, a resolution model of the coat protein of Cucumber Green Mottle Mosaic Virus (a coat protein bearing strong structural similarities to other tobamovirus coat proteins) and the virus can be found in Wang and Stubbs J. Mol. Biol. 239:371-384 (1994). Detailed structural information on the virus and coat protein of Tobacco Mosaic Virus can be found, among other places in Namba et al, J. Mol. Biol. 208:307-325 (1989) and Pattanayek and Stubbs J. Mol. Biol. 228:516-528 (1992).

[0025] Knowledge of the three dimensional structure of a plant virus particle and the assembly process of the virus particle permits the person of ordinary skill in the art to design various coat protein fusion s of the invention, including insertions, and partial substitutions. For example, if the protein of interest is of a hydrophilic nature, it may be appropriate to fuse the peptide to the TMVCP region known to be oriented as a surface loop region. Likewise, alpha helical segments that maintain subunit contacts might be substituted for appropriate regions of the TMVCP helices or nucleic acid binding domains expressed in the region of the TMVCP oriented towards the genome.

[0026] Polynucleotide sequences encoding the subject fusion proteins may comprise a “leaky” stop codon at a fusion joint. The stop codon may be present as the codon immediately adjacent to the fusion joint, or may be located close (e.g., within 9 bases) to the fusion joint. A leaky stop codon may be included in polynucleotides encoding the subject coat fusion proteins so as to maintain a desired ratio of fusion protein to wild type coat protein. A “leaky” stop codon does not always result in translational termination and is periodically translated. The frequency of initiation or termination at a given start/stop codon is context dependent. The ribosome scans from the 5′-end of a messenger RNA for the first ATG codon. If it is in a non-optimal sequence context, the ribosome will pass, some fraction of the time, to the next available start codon and initiate translation downstream of the first. Similarly, the first termination codon encountered during translation will not function 100% of the time if it is in a particular sequence context. Consequently, many naturally occurring proteins are known to exist as a population having heterogeneous N and/or C terminal extensions. Thus by including a leaky stop codon at a fusion joint coding region in a recombinant viral vector encoding a coat fusion protein, the vector may be used to produce both a fusion protein and a second smaller protein, e.g., the viral coat protein. A leaky stop codon may be used at, or proximal to, the fusion joints of fusion proteins in which the protein of interest portion is joined to the carboxyl terminus of the coat protein region, whereby a single recombinant viral vector may produce both coat fusion proteins and coat proteins. Additionally, a leaky start codon may be used at or proximal to the fusion joints of fusion proteins in which the protein of interest portion is joined to the amino terminus of the coat protein region, whereby a similar result is achieved. In the case of TMVCP, extensions at the N and C terminus are at the surface of viral particles and can be expected to project away from the helical axis. An example of a leaky stop sequence occurs at the junction of the 126/183 kDa reading frames of TMV and was described over 15 years ago (Pelham, H. R. B., 1978). Skuzeski et al. (1991) defined necessary 3′ context requirements of this region to confer leakiness of termination on a heterologous protein marker gene (β-glucuronidase) as CAR-YYA (C=cytidine, A=adenine, Y=pyrimidine).

[0027] In another embodiment of the invention, the fusion joints on the subject coat fusion proteins are designed so as to comprise an amino acid sequence that is a substrate for protease. By providing a coat fusion protein having such a fusion joint, the protein of interest may be conveniently derived from the coat protein fusion by using a suitable proteolytic enzyme. The proteolytic enzyme may contact the fusion protein either in vitro or in vivo.

[0028] The expression of the subject coat fusion proteins may be driven by any of a variety of promoters functional in the genome of the recombinant plant viral vector. In a preferred embodiment of the invention, the subject fusion proteins are expressed from plant viral subgenomic promoters using vectors as described in U.S. Pat. No. 5,316,931.

[0029] Recombinant DNA technologies have allowed the life cycle of numerous plant RNA viruses to be extended artificially through a DNA phase that facilitates manipulation of the viral genome. These techniques may be applied by the person ordinary skill in the art in order make and use recombinant plant viruses of the invention. The entire CDNA of the TMV genome was cloned and functionally joined to a bacterial promoter in an E. coli plasmid (Dawson et al., 1986). Infectious recombinant plant viral RNA transcripts may also be produced using other well known techniques, for example, with the commercially available RNA polymerases from T7, T3 or SP6. Precise replicas of the virion RNA can be produced in vitro with RNA polymerase and dinucleotide cap, m7GpppG. This not only allows manipulation of the viral genome for reverse genetics, but it also allows manipulation of the virus into a vector to express foreign genes. A method of producing plant RNA virus vectors based on manipulating RNA fragments with RNA ligase has proved to be impractical and is not widely used (Pelcher, L. E., 1982). Detailed information on how to make and use recombinant RNA plant viruses can be found, among other places in U.S. Pat. No. 5,316,931 (Donson et al.), which is herein incorporated by reference. The invention provides for polynucleotide encoding recombinant RNA plant vectors for the expression of the subject fusion proteins. The invention also provides for polynucleotides comprising a portion or portions of the subject vectors. The vectors described in U.S. Pat. No. 5,316,931 are particularly preferred for expressing the fusion proteins of the invention.

[0030] In addition to providing the described viral coat fusion proteins, the invention also provides for virus particles that comprise the subject fusion proteins. The coat of the virus particles of the invention may consist entirely of coat fusion protein. In another embodiment of the virus particles of the invention, the virus particle coat may consist of a mixture of coat fusion proteins and non-fusion coat protein, wherein the ratio of the two proteins may be varied. As tobamovirus coat proteins may self-assemble into virus particles, the virus particles of the invention may be assembled either in vivo or in vitro. The virus particles may also be conveniently dissassembled using well known techniques so as to simplify the purification of the subject fusion proteins, or portions thereof.

[0031] The invention also provides for recombinant plant cells comprising the subject coat fusion proteins and/or virus particles comprising the subject coat fusion proteins. These plant cells may be produced either by infecting plant cells (either in culture or in whole plants) with infections virus particles of the invention or with polynucleotides encoding the genomes of the infectious virus particle of the invention. The recombinant plant cells of the invention having many uses. Such uses include serving as a source for the fusion coat proteins of the invention.

[0032] The protein of interest portion of the subject fusion proteins may comprise many different amino acid residue sequences, and accordingly may different possible biological/chemical properties however, in a preferred embodiment of the invention the protein of interest portion of the fusion protein is useful as a vaccine antigen. The surface of TMV particles and other tobamoviruses contain continuous epitopes of high antigenicity and segmental mobility thereby making TMV particles especially useful in producing a desired immune response. These properties make the virus particles of the invention especially useful as carriers in the presentation of foreign epitopes to mammalian immune systems.

[0033] While the recombinant RNA viruses of the invention may be used to produce numerous coat fusion proteins for use as vaccine antigens or vaccine antigen precursors, it is of particular interest to provide vaccines against malaria. Human malaria is caused by the protozoan species Plasmodium falciparum, P. vivax, P. ovale and P. malariae and is transmitted in the sporozoite form by Anopheles mosquitos. Control of this disease will likely require safe and stable vaccines. Several peptide epitopes expressed during various stages of the parasite life cycle are thought to contribute to the induction of protective immunity in partially resistant individuals living in endemic areas and in individuals experimentally immunized with irradiated sporozoites.

[0034] When the fusion proteins of the invention, portions thereof, or viral particles comprising the fusion proteins are used in vivo, the proteins are typically administered in a composition comprising a pharmaceutical carrier. A pharmaceutical carrier can be any compatible, non-toxic substance suitable for delivery of the desired compounds to the body. Sterile water, alcohol, fats, waxes and inert solids may be included in the carrier. Pharmaceutically accepted adjuvants (buffering agents, dispersing agent) may also be incorporated into the pharmaceutical composition. Additionally, when the subject fusion proteins, or portion thereof, are to be used for the generation of an immune response, protective or otherwise, formulation for administration may comprise one or immunological adjuvants in order to stimulate a desired immune response.

[0035] When the fusion proteins of the invention, or portions thereof, are used in vivo, they may be administered to a subject, human or animal, in a variety of ways. The pharmaceutical compositions may be administered orally or parenterally, i.e., subcutaneously, intramuscularly or intravenously. Thus, this invention provides compositions for parenteral administration which comprise a solution of the fusion protein (or derivative thereof) or a cocktail thereof dissolved in an acceptable carrier, preferably an aqueous carrier. A variety of aqueous carriers can be used, e.g., water, buffered water, 0.4% saline, 0.3% glycerine and the like. These solutions are sterile and generally free of particulate matter. These compositions may be sterilized by conventional, well known sterilization techniques. The compositions may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents and the like, for example sodium acetate, sodium chloride, potassium chloride, calcium chloride, sodium lactate, etc. The concentration of fusion protein (or portion thereof) in these formulations can vary widely depending on the specific amino acid sequence of the subject proteins and the desired biological activity, e.g., from less than about 0.5%, usually at or at least about 1% to as much as 15 or 20% by weight and will be selected primarily based on fluid volumes, viscosities, etc., in accordance with the particular mode of administration selected.

[0036] Actual methods for preparing parenterally administrable compositions and adjustments necessary for administration to subjects will be known or apparent to those skilled in the art and are described in more detail in, for example, Remington's Pharmaceutical Science, current edition, Mack Publishing Company, Easton, Pa, which is incorporated herein by reference.

[0037] The invention having been described above, may be better understood by reference to the following examples. The examples are offered by way of illustration and are not intended to be interpreted as limitations on the scope of the invention.

EXAMPLES

[0038] Biological Deposits

[0039] The following present examples are based on a full length insert of wild type TMV (U1 strain) cloned in the vector pUC18 with a T7 promoter sequence at the 5′-end and a KpnI site at the 3′-end (pSNC004, FIG. 2) or a similar plasmid pTMV304. Using the polymerase chain reaction (PCR) technique and primers WD29 (SEQ ID NO: 1) and D1094 (SEQ ID NO: 2) a 277 XmaI/HindIII amplification product was inserted with the 6140 bp XmaI/KpnI fragment from pTMV304 between the KpnI and HindIII sites of the common cloning vector pUC18 to create pSNC004. The plasmid pTMV304 is available from the American Type Culture Collection, Rockville, Md. (ATCC deposit 45138). The genome of the wild type TMV strain can be synthesized from pTMV304 using the SP6 polymerase, or from pSNC004 using the T7 polymerase. The wild type TMV strain can also be obtained from the American Type Culture Collection, Rockville, Md. (ATCC deposit No. PV135). The plasmid pBGC152, Kumagai, M., et al., (1993), is a derivative of pTMV304 and is used only as a cloning intermediate in the examples described below. The construction of each plasmid vector described in the examples below is diagrammed in FIG. 3.

Example 1

[0040] Propagation and purification of the U1 strain of TMV

[0041] The TMVCP fusion vectors described in the following examples are based on the U1 or wild type TMV strain and are therefore compared to the parental virus as a control. Nicotiana tabacum cv Xanthi (hereafter referred to as tobacco) was grown 4-6 weeks after germination, and two 4-8 cm expanded leaves were inoculated with a solution of 50 μg/ml TMV U1 by pipetting 100 μl onto carborundum dusted leaves and lightly abrading the surface with a gloved hand. Six tobacco plants were grown for 27 days post inoculation accumulating 177 g fresh weight of harvested leaf biomass not including the two lower inoculated leaves. Purified TMV U1 Sample ID No. TMV204.B4 was recovered (745 mg) at a yield of 4.2 mg of virion per gram of fresh weight by two cycles of differential centrifugation and precipitation with PEG according to the method of Gooding et al. (1967). Tobacco plants infected with TMV U1 accumulated greater than 230 micromoles of coat protein per kilogram of leaf tissue.

Example 2

[0042] Production of a malarial B-cell epitope genetically fused to the surface loop region of the TMVCP

[0043] The monoclonal antibody NVS3 was made by immunizing a mouse with irradiated P. vivax sporozoites. NVS3 mAb passively transferred to monkeys provided protective immunity to sporozoite infection with this human parasite. Using the technique of epitope-scanning with synthetic peptides, the exact amino acid sequence present on the P. vivax sporozoite surface and recognized by NVS3 was defined as AGDR (Seq ID No. P1). The epitope AGDR is contained within a repeating unit of the circumsporozoite (CS) protein (Charoenvit et al., 1991a), the major immunodominant protein coating the sporozoite. Construction of a genetically modified tobamovirus designed to carry this malarial B-cell epitope fused to the surface of virus particles is set forth herein.

[0044] Construction of plasmid pBGC291. The 2.1 kb EcoRI-PstI fragment from pTMV204 described in Dawson, W., et al. (1986) was cloned into pBstSK- (Stratagene Cloning Systems) to form pBGC11. A 0.27 kb fragment of pBGC11 was PCR amplified using the 5′ primer TB2ClaI5′ (SEQ ID NO: 3) and the 3′ primer CP.ME2+ (SEQ ID NO: 4). The 0.27 kb amplified product was used as the 5′ primer and C/0AvrII (SEQ ID NO: 5) was the 3′ primer for PCR amplification. The amplified product was cloned into the SmaI site of pBstKS+ (Stratagene Cloning Systems) to form pBGC243.

[0045] To eliminate the BstXI and SacII sites from the polylinker, pBGC234 was formed by digesting pBstKS+ (Stratagene Cloning Systems) with BstXI followed by treatment with T4 DNA Polymerase and self-ligation. The 1.3 kb HindIII-KpnI fragment of pBGC304 was cloned into pBGC234 to form pBGC235. pBGC304 is also named pTMV304 (ATCC deposit 45138).

[0046] The 0.3 kb PacI-AccI fragment of pBGC243 was cloned into pBGC235 to form pBGC244. The 0.02 kb polylinker fragment of pBGC243 (SmaI-EcoRV) was removed to form pBGC280. A 0.02 kb synthetic PstI fragment encoding the P. vivax AGDR repeat was formed by annealing AGDR3p (SEQ ID NO: 6) with AGDR3m (SEQ ID NO: 7) and the resulting double stranded fragment was cloned into pBGC280 to form pBGC282. The 1.0 kb NcoI-KpnI fragment of pBGC282 was cloned into pSNC004 to form pBGC291.

[0047] The coat protein sequence of the virus TMV291 produced by transcription of plasmid pBGC291 in vitro is listed in (SEQ ID NO: 16) The epitope (AGDR)3 is calculated to be approximately 6.2% of the weight of the virion.

[0048] Propagation and purification of the epitope expression vector. Infectious transcripts were synthesized from KpnI-linearized pBGC291 using T7 RNA polymerase and cap (7mGpppG) according to the manufacturer (New England Biolabs). An increased quantity of recombinant virus was obtained by passaging and purifying Sample ID No. TMV291.1B1 as described in example 1. Twenty tobacco plants were grown for 29 days post inoculation, accumulating 1060 g fresh weight of harvested leaf biomass not including the two lower inoculated leaves. Purified Sample ID TMV291.1B2 was recovered (474 mg) at a yield of 0.4 mg virion per gram of fresh weight. Therefore, 25 μg of 12-mer peptide was obtained per gram of fresh weight extracted. Tobacco plants infected with TMV291 accumulated greater than 21 micromoles of peptide per kilogram of leaf tissue.

[0049] Product analysis. The conformation of the epitope AGDR contained in the virus TMV291 is specifically recognized by the monoclonal antibody NVS3 in ELISA assays (FIG. 4). By Western blot analysis, NVS3 cross-reacted only with the TMV291 cp fusion at 18.6 kD and did not cross-react with the wild type or cp fusion present in TMV261. The genomic sequence of the epitope coding region was confirmed by directly sequencing viral RNA extracted from Sample ID No. TMV291.1B2.

Example 3

[0050] Production of a malarial B-cell epitope genetically fused to the C terminus of the TMVCP

[0051] Significant progress has been made in designing effective subunit vaccines using rodent models of malarial disease caused by nonhuman pathogens such as P. yoelii or P. berghei. The monoclonal antibody NYS 1 recognizes the repeating epitope QGPGAP (SEQ ID NO: 18), present on the CS protein of P. yoelii, and provides a very high level of immunity to sporozoite challenge when passively transferred to mice (Charoenvit, Y., et al. 1991b). Construction of a genetically modified tobamovirus designed to carry this malarial B-cell epitope fused to the surface of virus particles is set forth herein.

[0052] Construction of plasmid pBGC261. A 0.5 kb fragment of pBGC11, was PCR amplified using the 5′ primer TB2ClaI5′ (SEQ ID NO: 3) and the 3′ primer C/0AvrII (SEQ ID NO: 5). The amplified product was cloned into the Smal site of pBstKS+(Stratagene Cloning Systems) to form pBGC218.

[0053] pBGC219 was formed by cloning the 0.15 kb AccI-NsiI fragment of pBGC218 into pBGC235. A 0.05 kb synthetic AvrII fragment was formed by annealing PYCS.1p (SEQ ID NO: 8) with PYCS.1m (SEQ ID NO: 9) and the resulting double stranded fragment, encoding the leaky-stop signal and the P. yoelii B-cell malarial epitope, was cloned into the AvrII site of pBGC219 to form pBGC221. The 1.0 kb NcoI-KpnI fragment of pBGC221 was cloned into pBGC152 to form pBGC261.

[0054] The virus TMV261, produced by transcription of plasmid pBGC261 in vitro, contains a leaky stop signal at the C terminus of the coat protein gene and is therefore predicted to synthesize wild type and recombinant coat proteins at a ratio of 20:1. The recombinant TMVCP fusion synthesized by TMV261 is listed in (SEQ ID NO: 19) with the stop codon decoded as the amino acid Y (amino acid residue 160). The wild type sequence, synthesized by the same virus, is listed in (SEQ ID NO: 21). The epitope (QGPGAP)2 is calculated to be present at 0.3% of the weight of the virion.

[0055] Propagation and purification of the epitope expression vector. Infectious transcripts were synthesized from KpnI-linearized pBGC261 using SP6 RNA polymerase and cap (7mGpppG) according to the manufacturer (Gibco/BRL Life Technologies).

[0056] An increased quantity of recombinant virus was obtained by passaging and purifying Sample ID No. TMV261.B1b as described in example 1. Six tobacco plants were grown for 27 days post inoculation, accumulating 205 g fresh weight of harvested leaf biomass not including the two lower inoculated leaves. Purified Sample ID No. TMV261.1B2 was recovered (252 mg) at a yield of 1.2 mg virion per gram of fresh weight. Therefore, 4 μg of 12-mer peptide was obtained per gram of fresh weight extracted. Tobacco plants infected with TMV261 accumulated greater than 3.9 micromoles of peptide per kilogram of leaf tissue.

[0057] Product analysis. The content of the epitope QGPGAP in the virus TMV261 was determined by ELISA with monoclonal antibody NYS1 (FIG. 5). From the titration curve, 50 ug/ml of TMV261 gave the same O.D. reading (1.0) as 0.2 ug/ml of (QGPGAP)2. The measured value of approximately 0.4% of the weight of the virion as epitope is in good agreement with the calculated value of 0.3%. By Western blot analysis, NYS1 cross-reacted only with the TMV261 cp fusion at 19 kD and did not cross-react with the wild type cp or cp fusion present in TMV291. The genomic sequence of the epitope coding region was confirmed by directly sequencing viral RNA extracted from Sample ID. No. TMV261.1B2.

Example 4

[0058] Production of a malarial CTL epitope genetically fused to the C terminus of the TMVCP

[0059] Malarial immunity induced in mice by irradiated sporozoites of P. yoelii is also dependent on CD8+ T lymphocytes. Clone B is one cytotoxic T lymphocyte (CTL) cell clone shown to recognize an epitope present in both the P. yoelii and P. berghei CS proteins. Clone B recognizes the following amino acid sequence; SYVPSAEQILEFVKQISSQ (SEQ ID NO: 23) and when adoptively transferred to mice protects against infection from both species of malaria sporozoites (Weiss et al., 1992). Construction of a genetically modified tobamovirus designed to carry this malarial CTL epitope fused to the surface of virus particles is set forth herein.

[0060] Construction of plasmid pBGC289. A 0.5 kb fragment of pBGC11 was PCR amplified using the 5′ primer TB2CIaI5′ (SEQ ID NO: 3) and the 3′ primer C/−5AvrII (SEQ ID NO: 10). The amplified product was cloned into the SmaI site of pBstKS+ (Stratagene Cloning Systems) to form pBGC214.

[0061] pBGC215 was formed by cloning the 0.15 kb AccI-NsiI fragment of pBGC214 into pBGC235. The 0.9 kb NcoI-KpnI fragment from pBGC215 was cloned into pBGC152 to form pBGC216.

[0062] A 0.07 kb synthetic fragment was formed by annealing PYCS.2p (SEQ ID NO: 11) with PYCS.2m (SEQ ID NO: 12) and the resulting double stranded fragment, encoding the P. yoelii CTL malarial epitope, was cloned into the AvrII site of pBGC215 made blunt ended by treatment with mung bean nuclease and creating a unique AatII site, to form pBGC262. A 0.03 kb synthetic AatII fragment was formed by annealing TLS.1EXP (SEQ ID NO: 13) with TLS.1EXM (SEQ ID NO: 14) and the resulting double stranded fragment, encoding the leaky-stop sequence and a stuffer sequence used to facilitate cloning, was cloned into AatII digested pBGC262 to form pBGC263. pBGC262 was digested with AatII and ligated to itself removing the 0.02 kb stuffer fragment to form pBGC264. The 1.0 kb NcoI-KpnI fragment of pBGC264 was cloned into pSNC004 to form pBGC289.

[0063] The virus TMV289 produced by transcription of plasmid pBGC289 in vitro, contains a leaky stop signal resulting in the removal of four amino acids from the C terminus of the wild type TMV coat protein gene and is therefore predicted to synthesize a truncated coat protein and a coat protein with a CTL epitope fused at the C terminus at a ratio of 20:1. The recombinant TMVCP/CTL epitope fusion present in TMV289 is listed in SEQ ID NO: 25 with the stop codon decoded as the amino acid Y (amino acid residue 156). The wild type sequence minus four amino acids from the C terminus is listed in SEQ ID NO: 26. The amino acid sequence of the coat protein of virus TMV216 produced by transcription of the plasmid pBGC216 in vitro, is also truncated by four amino acids. The epitope SYVPSAEQILEFVKQISSQ (SEQ ID NO:23) is calculated to be present at approximately 0.5% of the weight of the virion using the same assumptions confirmed by quantitative ELISA analysis of the read through properties of TMV261 in example 3.

[0064] Propagation and purification of the epitope expression vector. Infectious transcripts were synthesized from KpnI-linearized pBGC289 using T7 RNA polymerase and cap (7mGpppG) according to the manufacturer (New England Biolabs). An increased quantity of recombinant virus was obtained by passaging Sample ID No. TMV289.11B1a as described in example 1. Fifteen tobacco plants were grown for 33 days post inoculation accumulating 595 g fresh weight of harvested leaf biomass not including the two lower inoculated leaves. Purified Sample ID. No. TMV289.11B2 was recovered (383 mg) at a yield of 0.6 mg virion per gram of fresh weight. Therefore, 3 μg of 19-mer peptide was obtained per gram of fresh weight extracted. Tobacco plants infected with TMV289 accumulated greater than 1.4 micromoles of peptide per kilogram of leaf tissue.

[0065] Product analysis. Partial confirmation of the sequence of the epitope coding region of TMV289 was obtained by restriction digestion analysis of PCR amplified cDNA using viral RNA isolated from Sample ID. No. TMV289.11B2. The presence of proteins in TMV289 with the predicted mobility of the cp fusion at 20 kD and the truncated cp at 17.1 kD was confirmed by denaturing polyacrylamide gel electrophoresis.

[0066] LITERATURE CITED

[0067] Ahlquist, P. G., and French, R. C. 1986. RNA transformation vector. European Patent Appl. 194,809.

[0068] Bruening, G., 1978. Comovirus group, C.M.I./A.A.B. Descriptions of plant viruses, No. 199. Wm. Culross and Son Ltd., Coupar Angus, Perthshire, Scotland.

[0069] Butler, P. J. G., Mayo, M. A. 1987. Molecular architecture and assembly of tobacco mosaic virus particles, The molecular biology of the positive strand RNA viruses. (D. J. Rowlands, M. A. Mayo, and B. W. J. Mahy, eds.), Academic Press, London. pp. 237-257.

[0070] Charoenvit, Y., Collins, W. E., Jones, T. R., Millet, P., Yuan, L., Beaudoin, R. L., Broderson, J. R., and Hoffman, S. L. 1991a. Inability of malaria vaccine to induce antibodies to a protective epitope within its sequence. Science 251:668-671.

[0071] Charoenvit, Y., Mellouk, S., Cole, C., Bechara, R., Leef, M. F., Sedegah, M., Yuan, L., Robey, F. A., Beaudoin, R. L., and Hoffman, S. L. 1991b. Monoclonal, but not polyclonal, antibodies protect against Plasmodium yoelii sporozoites. J. Immunol. 146:1020-1025.

[0072] Dawson, W. O., Beck, D. L., Knorr, D. A., and Grantham, G. L. 1986. cDNA cloning of the complete genome of tobacco mosaic virus and production of infectious transcripts. Proc. Natl. Acad. Sci. USA 83:1832-1836.

[0073] Dawson, W. O., Bubrick, P., and Grantham, G. L. 1988. Modifications of the tobacco mosaic virus coat protein gene affecting replication, movement, and symptomatology. Phytopathology 78:783-789.

[0074] Dawson, W. O., Lewandowski, D. J., Hilf, M. E., Bubrick, P., Raffo, A. J., Shaw, J. J., Grantham, G. L., and Desjardins, P. R. 1989. A tobacco mosaic virus-hybrid expresses and loses an added gene. Virology 172:285-292.

[0075] Donson, J., Kearney, C. M., Hilf, M. E., and Dawson, W. 0. 1991. Systemic expression of a bacterial gene by a tobacco mosaic virus-based vector. Proc. Natl. Acad. Sci. USA 88:7204-7208.

[0076] Donson, J., Dawson, W. O., Grantham, G. L., Turpen, T. H., Turpen, A. M., Garger, S. J., and Grill, L. K. 1992. Recombinant viral vectors having heterologous subgenomic promoters for systemic expression of foreign genes. U.S. patent appl. Ser. No. 923,692.

[0077] French, R., Janda, M., and Ahlquist, P. 1986. Bacterial gene inserted in an engineered RNA virus: Efficient expression in monocotyledonous plant cells. Science 231:1294-1297.

[0078] Gibbs, A. J. 1977. Tobamovirus group, C.M.I./A.A.B. Descriptions of plant viruses, No. 184. Wm. Culross and Son Ltd., Coupar Angus, Perthshire, Scotland.

[0079] Goelet, P., Lomonossoff, G. P., Butler P. J. G., Akam, M. E., and Kam, J. 1982. Nucleotide sequence of tobacco mosaic virus RNA. Proc. Natl. Acad. Sci. USA 79:5818-5822.

[0080] Gooding, Jr., G. V., and Hebert, T. T. 1967. A simple technique for purification of tobacco mosaic virus in large quantities. Phytopathology 57:1285.

[0081] Hamamoto, H., Hashida, E., Matsunaga, Y., Nakagawa, N., Nakanishi, N., Okada, Y., Sugiyama, Y., and Tsuchimoto, S. 1993a. Plant virus vector for foreign gene expression-contains foreign gene down stream of viral coat protein gene, linked by read-through sequence. PCT Patent Application WO 93/JP408.

[0082] Hamamoto, H., Sugiyama, Y., Nakagawa, N., Hashida, E., Matsunaga, Y., Takemoto, S., Watanabe Y., and Okada, Y. 1993b. A new tobacco mosaic virus vector and its use for the systemic production of angiotensin-I-converting enzyme inhibitor in transgenic tobacco and tomato. Bio/Technology 11:930-932.

[0083] Haynes, J. R., Cunningham, J., von Seefried, A., Lennick, M., Garvin, R. T., and Shen, S.-H. 1986. Development of a genetically-engineered, candidate polio vaccine employing the self-assembling properties of the tobacco mosaic virus coat protein. Bio/Technology 4:637-641.

[0084] James, E. A., Garvin, R. T., and Haynes, J. R. 1985. Multispecific immunogenic proteins. European Patent Application, 174,759.

[0085] Krebbers, E., Bosch, D., and Vandekerckhove, J. 1992. Prospects and progress in the production of foreign proteins and peptides in plants, Plant Protein Engineering. (P. R. Shewry and S. Gutteridge, eds.), Cambridge University Press, Cambridge. pp. 316-324.

[0086] Kumagai, M. H., Turpen, T. H., Weinzettl, N., della-Cioppa, G., Turpen, A. M., Donson, J., Hilf, M. E., Grantham, G. L., Dawson, W. O., Chow, T. P., Piatak Jr., M., and Grill, L. K. 1993. Rapid, high level expression of biologically active α-trichosanthin in transfected plants by a novel RNA viral vector. Proc. Natl. Acad. Sci. USA 90:427-430.

[0087] Lomonossoff, G. P., and Johnson, J. E. 1992. Modified plant viruses as vectors. PCT Application WO 92/18618.

[0088] Mason, H. S., Lam, D. M-K., and Arntzen, C. J. 1992. Expression of hepatitis B surface antigen in transgenic plants. Proc. Natl. Acad. Sci. USA 89:11745-11749.

[0089] Okada, Y., and Han, K. 1986. Plant virus RNA vector. Japanese Patent Application 61/158443.

[0090] Okada, Y., and Takamatsu, N. 1988. A plant virus RNA vector. Japanese Patent Application 63/200789.

[0091] Pelcher, L. E., Halasa, M. C. 1982. An RNA plant virus vector or portion thereof, a method of construction thereof, and a method of producing a gene derived product therefrom. European Patent Appl. 067,553.

[0092] Pelham, H. R. B. 1978. Leaky UAG termination codon in tobacco mosaic virus RNA. Nature 272:469-471.

[0093] Skuzeski, J. M., Nichols, L. M., Gesteland, R. F., and Atkins, J. F. 1991. The signal for a leaky UAG stop codon in several plant viruses includes the two downstream codons. J. Mol. Biol. 218:365-373.

[0094] Takamatsu, N., Ishikawa, M., Meshi, T., and Okada, Y. 1987. Expression of bacterial chloramphenicol acetyltransferase gene in tobacco plants mediated by TMV-RNA. EMBO J. 6:307-311.

[0095] Takamatsu, N., Watanabe, Y., Yanagi, H., Meshi, T., Shiba, T., and Okada, Y. 1990. Production of enkephalin in tobacco protoplasts using tobacco mosaic virus RNA vector. FEBS Lett. 269:73-76.

[0096] Turpen, T. H., and Grill, L. K. Apr. 4, 1989. New products through viral coat protein modification. Biosource Genetics Corporation, Record of Invention, First Written Disclosure.

[0097] Usha, R., Rohll, J. B., Spall, V. E., Shanks, M., Maule, A. J., Johnson, J. E., and Lomonossoff, G. P. 1993. Expression of an animal virus antigenic site on the surface of a plant virus particle. Virology 197:366-374.

[0098] van Kammen, A., and de Jager, C. P. 1978. Cowpea mosaic virus, C.M.I./A.A.B. Descriptions of plant viruses, No. 197. Wm. Culross and Son Ltd., Coupar Angus, Perthshire, Scotland.

[0099] Weiss, W. R., Berzofsky, J. A., Houghten, R. A., Sedegah, M., Hollindale, M., and Hoffman, S. L. 1992. A T cell clone directed at the circumsporozoite protein which protects mice against both Plasmodium yoelii and Plasmodium berghei. J. Immunol. 149:2103-2109.

[0100] Zaitlin, M., and Israel, H. W. 1975. Tobacco mosaic virus (type strain), C.M.I./A.A.B. Descriptions of plant viruses, No. 151. Wm. Culross and Son Ltd., Coupar Angus, Perthshire, Scotland.

[0101] Incorporation by Reference

[0102] All patents, patents applications, and publications cited are incorporated herein by reference.

[0103] Equivalents

[0104] The foregoing written specification is considered to be sufficient to enable one skilled in the art to practice the invention. Indeed, various modifications of the above-described makes for carrying out the invention which are obvious to those skilled in the field of molecular biology or related fields are intended to be within the scope of the following claims.

1 27 49 base pairs nucleic acid unknown unknown DNA (genomic) 1 GGAATTCAAG CTTAATACGA CTCACTATAG TATTTTTACA ACAATTACC 49 18 base pairs nucleic acid unknown unknown DNA (genomic) 2 CCTTCATGTA AACCTCTC 18 25 base pairs nucleic acid unknown unknown DNA (genomic) 3 TAATCGATGA TGATTCGGAG GCTAC 25 36 base pairs nucleic acid unknown unknown DNA (genomic) 4 AAAGTCTCTG TCTCCTGCAG GGAACCTAAC AGTTAC 36 36 base pairs nucleic acid unknown unknown DNA (genomic) 5 ATTATGCATC TTGACTACCT AGGTTGCAGG ACCAGA 36 24 base pairs nucleic acid unknown unknown DNA (genomic) 6 GGCGATCGGG CTGGTGACCG TGCA 24 24 base pairs nucleic acid unknown unknown DNA (genomic) 7 CGGTCACCAG CCCGATCGCC TGCA 24 45 base pairs nucleic acid unknown unknown DNA (genomic) 8 CTAGCAATTA CAAGGTCCAG GTGCACCTCA AGGTCCTGGA GCTCC 45 45 base pairs nucleic acid unknown unknown DNA (genomic) 9 CTAGGGAGCT CCAGGACCTT GAGGTGCACC TGGACCTTGT AATTG 45 35 base pairs nucleic acid unknown unknown DNA (genomic) 10 ATTATGCATC TTGACTACCT AGGTCCAAAC CAAAC 35 66 base pairs nucleic acid unknown unknown DNA (genomic) 11 GTCATATGTT CCATCTGCAG AGCAGATCTT GGAATTCGTT AAGCAAATCT CGAGTCAGTA 60 ACTATA 66 66 base pairs nucleic acid unknown unknown DNA (genomic) 12 TATAGTTACT GACTCGAGAT TTGCTTAACG AATTCCAAGA TCTGCTCTGC AGATGGAACA 60 TATGAC 66 33 base pairs nucleic acid unknown unknown DNA (genomic) 13 CGACCTAGGT GATGACGTCA TAGCAATTAA CGT 33 33 base pairs nucleic acid unknown unknown DNA (genomic) 14 TAATTGCTAT GACGTCATCA CCTAGGTCGA CGT 33 4 amino acids amino acid unknown unknown peptide 15 Ala Gly Asp Arg 1 510 base pairs nucleic acid unknown unknown DNA (genomic) pBGC291 Fusion CDS 1..510 16 ATG TCT TAC AGT ATC ACT ACT CCA TCT CAG TTC GTG TTC TTG TCA TCA 48 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 GCG TGG GCC GAC CCA ATA GAG TTA ATT AAT TTA TGT ACT AAT GCC TTA 96 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 GGA AAT CAG TTT CAA ACA CAA CAA GCT CGA ACT GTC GTT CAA AGA CAA 144 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 TTC AGT GAG GTG TGG AAA CCT TCA CCA CAA GTA ACT GTT AGG TTC CCT 192 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 GCA GGC GAT CGG GCT GGT GAC CGT GCA GGA GAC AGA GAC TTT AAG GTG 240 Ala Gly Asp Arg Ala Gly Asp Arg Ala Gly Asp Arg Asp Phe Lys Val 65 70 75 80 TAC AGG TAC AAT GCG GTA TTA GAC CCG CTA GTC ACA GCA CTG TTA GGT 288 Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu Val Thr Ala Leu Leu Gly 85 90 95 GCA TTC GAC ACT AGA AAT AGA ATA ATA GAA GTT GAA AAT CAG GCG AAC 336 Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu Val Glu Asn Gln Ala Asn 100 105 110 CCC ACG ACT GCC GAA ACG TTA GAT GCT ACT CGT AGA GTA GAC GAC GCA 384 Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr Arg Arg Val Asp Asp Ala 115 120 125 ACG GTG GCC ATA AGG AGC GCG ATA AAT AAT TTA ATA GTA GAA TTG ATC 432 Thr Val Ala Ile Arg Ser Ala Ile Asn Asn Leu Ile Val Glu Leu Ile 130 135 140 AGA GGA ACC GGA TCT TAT AAT CGG AGC TCT TTC GAG AGC TCT TCT GGT 480 Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser Phe Glu Ser Ser Ser Gly 145 150 155 160 TTG GTT TGG ACC TCT GGT CCT GCA ACT TGA 510 Leu Val Trp Thr Ser Gly Pro Ala Thr 165 170 169 amino acids amino acid linear protein pBGC291 Fusion 17 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 Ala Gly Asp Arg Ala Gly Asp Arg Ala Gly Asp Arg Asp Phe Lys Val 65 70 75 80 Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu Val Thr Ala Leu Leu Gly 85 90 95 Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu Val Glu Asn Gln Ala Asn 100 105 110 Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr Arg Arg Val Asp Asp Ala 115 120 125 Thr Val Ala Ile Arg Ser Ala Ile Asn Asn Leu Ile Val Glu Leu Ile 130 135 140 Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser Phe Glu Ser Ser Ser Gly 145 150 155 160 Leu Val Trp Thr Ser Gly Pro Ala Thr 165 6 amino acids amino acid unknown unknown peptide 18 Gln Gly Pro Gly Ala Pro 1 5 525 base pairs nucleic acid unknown unknown DNA (genomic) pBGC261 Leaky Stop CDS 1..525 19 ATG TCT TAC AGT ATC ACT ACT CCA TCT CAG TTC GTG TTC TTG TCA TCA 48 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 GCG TGG GCC GAC CCA ATA GAG TTA ATT AAT TTA TGT ACT AAT GCC TTA 96 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 GGA AAT CAG TTT CAA ACA CAA CAA GCT CGA ACT GTC GTT CAA AGA CAA 144 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 TTC AGT GAG GTG TGG AAA CCT TCA CCA CAA GTA ACT GTT AGG TTC CCT 192 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 GAC AGT GAC TTT AAG GTG TAC AGG TAC AAT GCG GTA TTA GAC CCG CTA 240 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 GTC ACA GCA CTG TTA GGT GCA TTC GAC ACT AGA AAT AGA ATA ATA GAA 288 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 GTT GAA AAT CAG GCG AAC CCC ACG ACT GCC GAA ACG TTA GAT GCT ACT 336 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 CGT AGA GTA GAC GAC GCA ACG GTG GCC ATA AGG AGC GCG ATA AAT AAT 384 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 TTA ATA GTA GAA TTG ATC AGA GGA ACC GGA TCT TAT AAT CGG AGC TCT 432 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 TTC GAG AGC TCT TCT GGT TTG GTT TGG ACC TCT GGT CCT GCA ACC TAG 480 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser Gly Pro Ala Thr Tyr 145 150 155 160 CAA TTA CAA GGT CCA GGT GCA CCT CAA GGT CCT GGA GCT CCC TAG 525 Gln Leu Gln Gly Pro Gly Ala Pro Gln Gly Pro Gly Ala Pro 165 170 175 174 amino acids amino acid linear protein pBGC261 Leaky Stop 20 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser Gly Pro Ala Thr Tyr 145 150 155 160 Gln Leu Gln Gly Pro Gly Ala Pro Gln Gly Pro Gly Ala Pro 165 170 480 base pairs nucleic acid unknown unknown DNA (genomic) pBGC261 Nonfusion CDS 1..480 21 ATG TCT TAC AGT ATC ACT ACT CCA TCT CAG TTC GTG TTC TTG TCA TCA 48 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 GCG TGG GCC GAC CCA ATA GAG TTA ATT AAT TTA TGT ACT AAT GCC TTA 96 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 GGA AAT CAG TTT CAA ACA CAA CAA GCT CGA ACT GTC GTT CAA AGA CAA 144 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 TTC AGT GAG GTG TGG AAA CCT TCA CCA CAA GTA ACT GTT AGG TTC CCT 192 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 GAC AGT GAC TTT AAG GTG TAC AGG TAC AAT GCG GTA TTA GAC CCG CTA 240 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 GTC ACA GCA CTG TTA GGT GCA TTC GAC ACT AGA AAT AGA ATA ATA GAA 288 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 GTT GAA AAT CAG GCG AAC CCC ACG ACT GCC GAA ACG TTA GAT GCT ACT 336 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 CGT AGA GTA GAC GAC GCA ACG GTG GCC ATA AGG AGC GCG ATA AAT AAT 384 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 TTA ATA GTA GAA TTG ATC AGA GGA ACC GGA TCT TAT AAT CGG AGC TCT 432 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 TTC GAG AGC TCT TCT GGT TTG GTT TGG ACC TCT GGT CCT GCA ACC TAG 480 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser Gly Pro Ala Thr 145 150 155 160 159 amino acids amino acid linear protein pBGC261 Nonfusion 22 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser Gly Pro Ala Thr 145 150 155 19 amino acids amino acid unknown unknown DNA (genomic) 23 Ser Tyr Val Pro Ser Ala Glu Gln Ile Leu Glu Phe Val Lys Gln Ile 1 5 10 15 Ser Ser Gln 537 base pairs nucleic acid unknown unknown DNA (genomic) pBGC289 Leaky Stop CDS 1..537 24 ATG TCT TAC AGT ATC ACT ACT CCA TCT CAG TTC GTG TTC TTG TCA TCA 48 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 GCG TGG GCC GAC CCA ATA GAG TTA ATT AAT TTA TGT ACT AAT GCC TTA 96 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 GGA AAT CAG TTT CAA ACA CAA CAA GCT CGA ACT GTC GTT CAA AGA CAA 144 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 TTC AGT GAG GTG TGG AAA CCT TCA CCA CAA GTA ACT GTT AGG TTC CCT 192 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 GAC AGT GAC TTT AAG GTG TAC AGG TAC AAT GCG GTA TTA GAC CCG CTA 240 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 GTC ACA GCA CTG TTA GGT GCA TTC GAC ACT AGA AAT AGA ATA ATA GAA 288 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 GTT GAA AAT CAG GCG AAC CCC ACG ACT GCC GAA ACG TTA GAT GCT ACT 336 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 CGT AGA GTA GAC GAC GCA ACG GTG GCC ATA AGG AGC GCG ATA AAT AAT 384 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 TTA ATA GTA GAA TTG ATC AGA GGA ACC GGA TCT TAT AAT CGG AGC TCT 432 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 TTC GAG AGC TCT TCT GGT TTG GTT TGG ACG TCA TAG CAA TTA ACG TCA 480 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser Tyr Gln Leu Thr Ser 145 150 155 160 TAT GTT CCA TCT GCA GAG CAG ATC TTG GAA TTC GTT AAG CAA ATC TCG 528 Tyr Val Pro Ser Ala Glu Gln Ile Leu Glu Phe Val Lys Gln Ile Ser 165 170 175 AGT CAG TAG 537 Ser Gln 178 amino acids amino acid linear protein pBGC289 Leaky Stop 25 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser Tyr Gln Leu Thr Ser 145 150 155 160 Tyr Val Pro Ser Ala Glu Gln Ile Leu Glu Phe Val Lys Gln Ile Ser 165 170 175 Ser Gln 468 base pairs nucleic acid unknown unknown DNA (genomic) pBGC289 Non-fusion CDS 1..468 26 ATG TCT TAC AGT ATC ACT ACT CCA TCT CAG TTC GTG TTC TTG TCA TCA 48 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 GCG TGG GCC GAC CCA ATA GAG TTA ATT AAT TTA TGT ACT AAT GCC TTA 96 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 GGA AAT CAG TTT CAA ACA CAA CAA GCT CGA ACT GTC GTT CAA AGA CAA 144 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 TTC AGT GAG GTG TGG AAA CCT TCA CCA CAA GTA ACT GTT AGG TTC CCT 192 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 GAC AGT GAC TTT AAG GTG TAC AGG TAC AAT GCG GTA TTA GAC CCG CTA 240 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 GTC ACA GCA CTG TTA GGT GCA TTC GAC ACT AGA AAT AGA ATA ATA GAA 288 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 GTT GAA AAT CAG GCG AAC CCC ACG ACT GCC GAA ACG TTA GAT GCT ACT 336 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 CGT AGA GTA GAC GAC GCA ACG GTG GCC ATA AGG AGC GCG ATA AAT AAT 384 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 TTA ATA GTA GAA TTG ATC AGA GGA ACC GGA TCT TAT AAT CGG AGC TCT 432 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 TTC GAG AGC TCT TCT GGT TTG GTT TGG ACG TCA TAG 468 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser 145 150 155 155 amino acids amino acid linear protein pBGC289 Non-fusion 27 Met Ser Tyr Ser Ile Thr Thr Pro Ser Gln Phe Val Phe Leu Ser Ser 1 5 10 15 Ala Trp Ala Asp Pro Ile Glu Leu Ile Asn Leu Cys Thr Asn Ala Leu 20 25 30 Gly Asn Gln Phe Gln Thr Gln Gln Ala Arg Thr Val Val Gln Arg Gln 35 40 45 Phe Ser Glu Val Trp Lys Pro Ser Pro Gln Val Thr Val Arg Phe Pro 50 55 60 Asp Ser Asp Phe Lys Val Tyr Arg Tyr Asn Ala Val Leu Asp Pro Leu 65 70 75 80 Val Thr Ala Leu Leu Gly Ala Phe Asp Thr Arg Asn Arg Ile Ile Glu 85 90 95 Val Glu Asn Gln Ala Asn Pro Thr Thr Ala Glu Thr Leu Asp Ala Thr 100 105 110 Arg Arg Val Asp Asp Ala Thr Val Ala Ile Arg Ser Ala Ile Asn Asn 115 120 125 Leu Ile Val Glu Leu Ile Arg Gly Thr Gly Ser Tyr Asn Arg Ser Ser 130 135 140 Phe Glu Ser Ser Ser Gly Leu Val Trp Thr Ser 145 150 155 

What is claimed is:
 1. A polynulceotide encoding fusion protein, the fusion protein consisting essentially of a tobamovirus coat protein fused to a protein of interest at a fusion joint.
 2. A polynucleotide according to claim 1, wherein the fusion is an amino terminus fusion.
 3. A polynucleotide according to claim 1, wherein the fusion is a carboxy terminus fusion.
 4. A polynucleotide according to claim 1, wherein the fusion is an internal fusion.
 5. A polynucleotide according to claim 1, wherein the fusion joint comprises a leaky stop codon.
 6. A polynucleotide according to claim 1, wherein the fusion joint comprises a leaky start codon.
 7. A polynucleotide according to claim 1, wherein the protein of interest is an antigen.
 8. A polynucleotide according to claim 1, wherein the coat protein is a tobacco mosaic virus coat protein.
 9. A recombinant plant viral genome comprising a polynucleotide according to claim
 1. 10. A recombinant plant virus particle, comprising a genome according to claim
 9. 11. A polypeptide encoded by a polynucleotide according to claim
 1. 12. A recombinant plant virus, wherein the coat protein is encoded by a polynucleotide according to claim
 1. 13. A plant cell comprising of polynucleotide according to claim
 9. 